Corpus: slv_newscrawl_2015_1M

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 59053 p-
2 31526 s-
3 26159 n-
4 22470 o-
5 21834 z-
Top Character Bigrams
word rank frequency n-gram
1 25478 pr-
2 22188 po-
3 12786 za-
4 12157 na-
5 10305 ne-
Top Character Trigrams
word rank frequency n-gram
1 12143 pre-
2 6878 pri-
3 5302 raz-
4 4199 pro-
5 3330 pos-
Top Character 4-Grams
word rank frequency n-gram
1 2055 pred-
2 1384 zurn-
3 1286 pres-
4 1206 zani-
5 1200 samo-
Top Character 5-Grams
word rank frequency n-gram
1 1384 zurna-
2 1100 zanim-
3 851 Slove-
4 790 proti-
5 514 inter-
3641 msec needed at 2018-03-23 23:07